Accepted for Publication in The Astrophysical Journal, 20 July 2005, v628 

Preprint typeset using I£T^X style cmulatcapj v. 6/22/04 



in 
o 
o 

(N 
u 

< 
(N 



(N 
> 

m 

On 

(N 
O 

in 
o 

6 ■ 
a: 



13 



THE LOW END OF THE SUPERMASSIVE BLACK HOLE MASS FUNCTION: 
CONSTRAINING THE MASS OF A NUCLEAR BLACK HOLE IN NGC 205 
VIA STELLAR KINEMATICS 

Monica Valluri, 1 Laura Ferrarese, 2 ' 3 David Merritt, 4 and Charles L. Joseph 2 

Accepted for Publication in The Astrophysical Journal, 20 July 2005, v628 

ABSTRACT 

Hubble Space Telescope (HST) images and spectra of the nucleated dwarf elliptical galaxy NGC 205 
are combined with 3- integral axisymmetric dynamical models to constrain the mass Mbh of a putative 
nuclear black hole. This is only the second attempt, after M33, to use resolved stellar kinematics to 
search for a nuclear black hole with mass below 10 6 solar masses. We are unable to identify a best-fit 
value of Mbh in NGC 205; however, the data impose a upper limit of 2.2 x 10 4 M© (la confidence) and 
and upper limit of 3.8 x 10 4 M Q (3cr confidence). This upper limit is consistent with the extrapolation 
of the M B h ~ o relation to the M BH < 10 6 M o regime. If we assume that NGC 205 and M33 both 
contain nuclear black holes, the upper limits on Mbh in the two galaxies imply a slope of ~ 5.5 or 
greater for the Mbh — & relation. We use our 3-integral models to evaluate the relaxation time and 
stellar collision time in NGC 205; T r is ~ 10 8 yr or less in the nucleus and T co u s» 10 11 yr. The 
low value of T r is consistent with core collapse having already occurred, but we are unable to draw 
conclusions from nuclear morphology about the presence or absence of a massive black hole. 
Subject headings: galaxies: elliptical and lenticular — galaxies: structure — galaxies: nuclei — stellar 
dynamics 



1. INTRODUCTION 

Evidence for the existence of supermassive black holes 
(SBHs) in galactic centers has increased steadily in the 
past decade. Kinematical detections with varying de- 
grees of quality have been made in roughly three dozen 
galaxies (see Ferrarese & Ford 2004 for a review), and 
virtually airtight evidence exists in two cases: our own 
galactic center (Ghez et al. 2003; Schodel et al. 2003) 
and NGC 4258 (Miyoshi et al. 1995). The measured 
masses obey tantalizingly tight relations with the bulge 
luminosity (Kormendy & Richstone 1995; McLure & 
Dunlop 2002; Marconi & Hunt 2003), the degree of con- 
centration of the bulge light (Graham et al. 2001; Erwin 
et al. 2004), the central velocity dispersion of the stellar 
component (Ferrarese & Merritt 2000) and the velocity 
dispersion on kiloparsec (Gebhardt et al. 2000a), and the 
virial velocity tens-of- kiloparsec scales (Ferrarese 2002). 

These relations are well established at masses exceed- 
ing ~ 1O 7 M , the regime spanned by all but two of the 
current detections. Below ~ 10 7 M Q only two mea- 
surements exist: in the Milky Way (M B h = 4 x 10 6 
Mq, Ghez et al. 2003; Schodel et al. 2003) and in 
M32 (M BH = 2.5 x 10 6 M , Verolme et al. 2002, 
van der Marel et al. 1998). Yet extending the rela- 
tions to the intermediate-mass black hole (IBH) regime 
(M B h 10 6 M q ) is crucial for constraining models of 
black hole formation (Haehnelt, Natarajan & Rees 1998; 
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Cattaneo, Haehnelt & Rees 1999; Monaco, Salucci & 
Danese 2000; Bromley et al. 2004). For instance, the 
simple existence of IBHs would pose a serious challenge 
for models in which nuclear black holes are born in situ 
from the collapse of a "supermassive star" (e.g. Haehnelt 
et al. 1998). It would on the other hand, provide sup- 
port for the alternative scenarios in which SBHs evolve 
from the migration to galactic centers of seed black holes 
produced, for instance, from the collapse of Population 
III stars, or from dynamical processes in ordinary star 
clusters (Giirkan, Freitag & Rasio 2004; Portegies Zwart 
et al. 2004; Islam, Taylor & Silk 2003, Portegies Zwart 
& McMillan 2002; Miller & Hamilton 2002). 

Currently no fully compelling mass determinations ex- 
ist in this regime. The interpretation of the super- 
luminous off-nuclear X-ray sources (ULXs) detected in 
a number of nearby galaxies (Miller & Colbert 2004) is 
under debate. Claimed kinematical detections of IBHs 
in globular clusters (Gebhardt, Rich & Ho 2002; Gerssen 
et al. 2002) and dense star clusters (Maillard et al. 
2004) have generally failed to stand up to close scrutiny 
(Gerssen et al. 2003; Baumgardt et al. 2003a&b; Mc- 
Namara, Harrison & Anderson 2003). Although indirect 
evidence exists for IBHs in Seyfert galaxies (Filippenko & 
Ho 2003; Greene & Ho 2004), a clear dynamical detection 
is still lacking. In galactic nuclei, only one firm upper 
limit, of a few thousand solar masses, has been estab- 
lished for an IBH, in the late-type spiral M33 (Merritt, 
Ferrarese & Joseph 2001; Gebhardt et al. 2001). There is 
a simple reason for this rather unsatisfactory state of af- 
fairs: the dynamical influence of a ~ 10 5 M Q IBH is domi- 
nant within ~ 0.3 pc (the so-called "radius of influence"), 
assuming that the observed scaling of nuclear proper- 
ties with Mbh continues into the IBH regime. At op- 
tical wavelengths and at Hubble Space Telescope (HST) 
resolution (~ 0"1), this restricts the search to galaxies 
closer than ~ 1 Mpc. Several dE and dSph galaxies fit 
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this criterion, but the need to observe objects with high 
enough central surface brightness leaves only one unex- 
amined candidate: NGC 205, a dEn companion to the 
Andromeda galaxy. NGC 205 has been well studied from 
the ground (e.g. Bica et al. 1990; Carter & Sadler 1990; 
da Costa & Mould 1988; Lee 1996; Davidge 2003; and 
references therein) and in imaging mode only, with HST 
(Jones et al. 1996; Cappellari et al. 1999; Bertola et 
al. 1995). Along with M32 and the Sagittarius Dwarf 
(Monaco et al. 2005) it is one of the three spheroidals 
in the Local Group to boast a photometrically distinct 
nucleus (van den Bergh 1999). The dominant stellar pop- 
ulation in the nucleus is known to have formed between 
100 and 500 Myr ago, after star formation had ceased 
in the central region (Lee 1996). The galaxy shows no 
sign of nuclear activity (Ho, Filippenko & Sargent 2003; 
Fabbiano, Kim & Trinchieri 1992; Condon et al. 1998). 
Based on a measurement of the luminosity profile of the 
nucleus, combined with an estimate of the nuclear M/L 
ratio, Jones et al. (1996) estimate a mass for the nucleus 
of 9 x 10 4 M©, which therefore sets an upper limit to the 
mass of any putative nuclear IBH. The relation between 
Mbh and the central stellar velocity dispersion a pre- 
dicts Mbh = 7.4 x 10 4 M Q (adopting a central velocity 
dispersion of a = 39 ± 6 km s _1 ; Peterson & Caldwell 
1993). The influence radius of such an IBH would be 
~ 0.2 pc, barely resolvable by HST at the galaxy's dis- 
tance of 740 kpc (Ferrarese et al. 2000). 

In this paper we present new HST images and spectra 
of NGC 205 (§ El; describe the modeling method and 
derive estimates of the mass-to-light ratios for the inner 
and outer regions (§ |3J| ; construct axisymmetric dynami- 
cal models that are consistent with the kinematical data 
and search for a best-fit value of M BH (§ 0J; discuss the 
implications for black hole scaling relations (§ E}; and 
discuss the constraints that the morphology of the nu- 
cleus place on the existence of a massive black hole (§EJ). 
§0 sums up. 

2. OBSERVATIONS AND DATA ANALYSIS 

2.1. HST/STIS Spectra 

The nucleus of NGC 205 was observed for a total of 
24,132s with the 52"x 0"1 slit of HST's Space Tele- 
scope Imaging Spectrograph (STIS) between 15 October 
2002, and 22 October 2002. The Ca II absorption triplet 
(AA = 8498.06, 8542.14, 8662.17 A) was sampled with 
the G750M grating^ covering the spectral region between 
8275 A and 8847 A with 19.8 km s" 1 pixel" 1 spectral 
sampling (at 8500 A). The STIS CCD pixel scale is 0"05, 
giving a spatial resolution of 0"115. Eleven spectra were 
each divided into a pair of consecutive ("CR-split") ex- 
posures to facilitate removal of cosmic rays; in addition, 
each spectrum was repeatedly shifted relative to the first 
by 2.25, 4.5 and 6.75 pixels along the dispersion direc- 
tion, both to allow for correction of bad pixels and to 
improve the spatial resolution. The spectral calibration 
was performed using the IRAF task CALSTIS following 
the standard procedure outlined in the STIS Data Hand- 
book (version 3.0, Mobasher, Corbin & Hsu 2001). 

Correction for analog-to-digital conversion errors, bias 
and dark subtraction, pixel-to-pixel flat field division, 2D 
rectification and wavelength calibration were performed 
by the IRAF task CALSTIS, using the most up-to-date 
calibration files provided by the Space Telescope Sci- 
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Fig. 1. — STIS spectra obtained at 12 positions along the slit. 
The lowest to frames show STIS spectra for the two template stars 
used in the analysis. 



ence Institute. CALSTIS was also used to combine pairs 
of "CR-split" exposures, performing cosmic ray rejec- 
tion in the process, and to refine the wavelength cali- 
bration using spectral lamp exposures obtained imme- 
diately before each spectrum. The latter step is neces- 
sary to correct for spectral shifts due to positioning by 
the Mode Select Mechanism and thermal motions. In 
addition, each spectrum was corrected for the "fringe" 
pattern which affects observations longward of 7000 A, 
using flat fields obtained immediately before each spec- 
trum, each calibrated (using the same procedure adopted 
for the NGC205 spectra) and normalized. Finally, all 
spectra were combined using the IRAF task DRIZZLE, 
with shifts calculated from the astrometric information 
recorded in the "jitter" files. The actual recorded shifts 
between spectra were found to be within 0''35 of the com- 
manded ones, well within the positioning uncertainties of 
the instrument. 

Spectra were extracted between — 0"266 and +0''289 
from the nucleus and are shown in Figure At each 
location, the line-of-sight velocity distribution (LOSVD) 
was determined by deconvolving the nuclear spectra with 
one or more stellar templates, using the maximum pe- 
nalized likelihood algorithm (Merritt 1997). Once recov- 
ered nonparametrically, the LOSVDs were represented in 
terms of Gauss-Hermite (GH) series (Gerhard 1993; van 
der Marel & Franx 1993). Confidence intervals on all 
quantities were computed via standard bootstrap tech- 
niques. 

Recovery of the LOSVDs requires knowledge of the 
spectrum of a star representative of the entire stellar en- 
semble. In all the galaxies for which STIS stellar kine- 
matical measurements have been obtained from stellar 
absorption lines (e.g. Gebhardt et al. 2003; Pinkney et 
al. 2003; Cappellari et al. 2002), the stellar population 
in the nucleus has been old enough that a single template 
star- generally that of the G8 III giant HD141680 - has 
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been adopted. Although other template stars observed 
with the same instrumental configuration used for the 
NGC 205 spectra exist in the HST archives, all but one 
are of late G or early type K giant stars; the exception 
is HD115617, a main sequence G5 star. As noted above, 
none of these standards is representative of the stellar 
population in NGC 205, raising the possibility that the 
kinematical parameters derived from the spectra are bi- 
ased as a consequence of template mismatch. To explore 
this possibility, we have compared the LOSVDs derived 
using HD115617 (spectral type G8 III, hereafter Tem- 
plate 1) and HD141680 (spectral type G5 V, hereafter 
Template 2) as templates: their spectra are different 
enough (see Fig to allow us to judge the effects of 
template mismatch on the results. 

The first four terms from the Gauss-Hermite expan- 
sion, V, cr, hz, and h±, (van der Marel & Franx 1993; 
Joseph et al. 2001) derived from Template 1 and Tem- 
plate 2 are tabulated in Table 1 and shown in Figure 
as crosses and filled circles respectively (the points from 
Template 2 have been offset slightly along the horizon- 
tal axis to allow for better visual comparison). Although 
the use of Template 1 resulted in a slightly better fit to 
the NGC205 spectrum (as measured via the integrated 
square error), the V and a values from both templates 
are identical to within la statistical errors. The overall 
consistency in the /13 and /14 terms is also quite good: all 
values are consistent with zero. 

The largest discrepancy is seen in the shape of the cen- 
tral (r = -0"012) LOSVD, shown in Figuredl this is par- 
ticularly relevant since it is on the central LOSVD that 
a black hole (if present) is expected to have the strongest 
effect. The noticable asymmetry in the central LOSVD 
obtained when using Template 1 (solid line) is absent 
when Template 2 is used instead (dot-dash line). In an 
axisymmetric galaxy, strong asymmetries in the central 
LOSVD are unphysical, and the asymmetry recovered 
with Template 1 is therefore very likely the spurious re- 
sult of template mismatch. 

In summary, the consistency in the V, a, /13, and /14 pa- 
rameters derived using the two different templates gives 
us some degree of confidence that template mismatch is 
unlikely to significantly affect our results, although we 
should treat results which make use of the full central 
LOSVD with caution. We further note that for stellar 
types earlier than F stars, temperature and rotation in- 
crease the intrinsic width of the lines, and the presence 
of hydrogen absorption lines from the Paschen series can 
further complicate matters as they can overlap with the 
calcium features. In general, using an old stellar template 
to measure the kinematics of a young stellar population 
in this spectral region will cause the velocity dispersion, 
and therefore Mbh, to be overestimated. Since only an 
upper limit to Mbh will be derived in the following sec- 
tions, we conclude that the upper limit could only de- 
crease if the correct stellar template were used. In other 
words, the value derived below should be considered as 
a very firm upper limit to the mass of a central IBH in 
NGC 205. 

Finally, STIS data were complemented at large radii 
with the ground-based kinematic data from Bender, Pa- 
quet & Nieto (1991), obtained with the 3.5m telescope 
at Calar Alto, consisting of 31 apertures within 84.56". 



2.2. HST/ ACS Images 

In order to determine the luminosity distribution in 
NGC 205, the galaxy was observed on 03 October 2002 
for a total of 2,440s with the F814W filter (~ Johnsons' 
/) of the Advanced Camera for Surveys (ACS) High Res- 
olution Channel (HRC). The HRC pixel scale is 0"027, 
giving a field of view of 29"x26". To improve the spa- 
tial resolution, NGC 205 was observed at four different 
pointings, each consisting of two CR-split images, ar- 
ranged at the corners of a square of side 0"0989. To- 
tal exposure times were 2,560s in F555W and 2,440s 
in F814W. Image calibration was performed using the 
IRAF tasks CALACS and PYDRIZZLE following the 
standard procedure described in the ACS Data Hand- 
book (Mobasher, Corbin, & Hsu 2003). Basic reduction 
(bias and dark subtraction, removal of overscan regions, 
flat fielding and cosmic ray removal) was performed us- 
ing IRAF's CALACS, while the task PYDRIZZLE was 
used to correct for geometric distortion, and combine the 
four drizzled images. 

The structural parameters of NGC205 will be discussed 
in detail in a forthcoming paper. Briefly, two indepen- 
dent procedures were adopted to recover the isophotal 
parameters. The IRAF task ELLIPSE (Jedrzejewski 
1987) was employed to find the best fitting isophotes by 
iteratively sampling the image along elliptical paths with 
given semi-major axis length, keeping as free parameters 
the position of the center, ellipticity, semi-major axis po- 
sition angle, and high order coefficients describing devi- 
ations of the isophotes from ellipses. ELLIPSE is best 
applied to galaxies with a smooth surface brightness dis- 
tribution; in the case of NGC 205, the task fails to con- 
verge beyond 0"8, where the stellar population is clearly 
resolved. Fischer et al. (1992) introduced a different 
approach, in which the image is divided in ellipsoidal an- 
nuli, each further divided into eight sectors. The median 
of the average brightnesses determined in each sector is 
taken as the average brightness of each annulus. In the 
case of NGC 205, the center and ellipticity of each annu- 
lus was set equal to the center found by ELLIPSE within 
0"8, and the average ellipticity measured by Kim & Lee 
(1998) and Lee (1996) from ground based data respec- 
tively. 

The photometric calibration was performed using the 
zero points from Sirianni et al. (2004) and a fore- 
ground extinction correction E(B—V) = 0.081 (Schlegel, 
Finkbeiner & Davis 1998), giving A(I) = 0.151 for 
R v = A(V)/E(B — V) — 3.1 and A(I)/A{V) = 0.594 
(Cardelli, Clayton & Mathis 1989). The resulting surface 
brightness profile is shown in Figure^ where we also plot 
the /—band surface brightness profiles measured from 
ground-based data by Kim & Lee (1998) and Lee (1996). 
The ACS surface brightness profiles determined using the 
two methods described above are in excellent agreement. 
Both agree with the ground-based profiles except in the 
innermost ~ 1", where the latter are strongly affected by 
seeing. The final, combined, composite /—band surface 
brightess profile is tabulated in Table 2. 

The 3-dimensional luminosity density j*(xu,z) was 
computed via a non-parametric algorithm (Merritt, Mey- 
lan & Mayor 1997) under the assumption that the galaxy 
is seen edge-on. The inner radius of the surface bright- 
ness distribution 0"025 and the outer radius of the sur- 
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TABLE 1 

HST/STIS Kinematics for the Nucleus of NGC 205 



(1) 



V AV 

(2) (3) 



Template 
a Act 

(4) (5) 



HD141680 
h 3 Ah 3 
(6) (7) 



/14 A/14 
(8) (9) 



V 
(10) 



Template 
AV a Act 
(11) (12) (13) 



HD115617 
h 3 Ah 3 
(14) (15) 



/14 A/l4 

(16) (17) 1 



-0.266 
-0.215 
-0.164 
-0.113 
-0.062 
-0.012 
0.039 
0.090 
0.141 
0.189 
0.239 
0.289 



4.62 7.45 
9.97 5.80 
-1.62 4.30 
-5.87 4.00 
-4.50 4.05 
-0.40 4.30 
0.10 3.60 
-3.19 4.00 
-4.12 5.00 
-3.97 4.95 
-0.19 5.70 
-1.85 6.00 



17.04 
22.12 
15.75 
15.39 
18.14 
20.55 
16.20 
16.60 
17.69 
15.61 
11.73 
10.18 



5.65 
7.05 
3.55 
4.45 
4.65 
4.00 
2.95 
3.55 
4.85 
5.20 
2.80 
3.45 



-0.13 
-0.14 
0.05 
0.03 
0.00 
0.03 
0.06 
0.05 
0.03 
-0.10 
-0.04 
-0.02 



0.13 
0.15 
0.09 
0.05 
0.06 
0.07 
0.08 
0.08 
0.07 
0.07 
0.04 
0.05 



0.06 
0.05 
0.05 
0.06 
0.09 
0.09 
0.04 
0.06 
0.08 
0.06 
0.03 
0.02 



0.05 
0.04 
0.03 
0.03 
0.04 
0.03 
0.02 
0.03 
0.04 
0.04 
0.02 
0.02 



0.47 
13.23 
-1.09 
-5.02 
-5.57 
-0.62 

2.85 
-0.12 

0.97 
-0.72 

0.91 

1.64 



6.75 21 
6.60 28 



3.95 
3.65 
3.85 
3.95 
2.80 
3.70 
5.25 
6.35 
5.35 
5.05 



90 6.20 
16 7.80 

59 2.80 
87 1.75 
46 2.80 
41 3.35 
93 1.95 
52 3.55 
62 4.55 

60 5.20 
22 2.05 
57 1.35 



0.01 
0.00 
0.08 
0.07 
0.08 
0.05 
0.01 
0.08 
0.06 
-0.11 
0.00 
0.00 



0.07 
0.10 
0.03 
0.02 
0.02 
0.06 
0.02 
0.05 
0.06 
0.10 
0.03 
0.01 



0.00 
-0.02 
0.05 
0.04 
0.04 
0.03 
0.00 
0.04 
0.04 
0.07 
0.01 
0.01 



0.04 
0.06 
0.02 
0.01 
0.01 
0.02 
0.00 
0.03 
0.03 
0.06 
0.01 
0.00 



1 Columns are: (1) Aperture distance from the morphological cen- 
ter of NGC 205; (2, 10) velocity (in km s _1 ) relative to the helio- 
centric velocity of NGC205 (-241km s^ 1 ); (3, 11) Ict error on V; 

(4, u 

14) G 

Gausi 



HDU1680 
J HD1 15617- 




Fig. 2. — Nuclear kinematical data (V, ct, /13, /14) along the slit obtained by deconvolving STIS spectra (Fig 1) with two different stellar 
templates HD141680 (crosses) HD115617 (filled circles - which have been offset slightly along the horizontal axis by an arbitrary amount 
to enhance visibility.) 



face brightness distribution is 100". The deprojection is 
performed on a 2D logarithmically spaced grid with 100 
grid points in each dimension. The algorithm accounts 
for the varying cllipticity as a function of radius and com- 
putes j*{xn, z) on the 2D grid by minimizing the resid- 



uals between the projected j„ and the observed surface 
brightness. Since the deprojection is an unstable pro- 
cess, smoothness was enforced via a "thin-plate smooth- 
ing spline" penalty function (Wahba 1990). 

3. DYNAMICAL MODELING 
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V (km/s) 

Fig. 3. — LOSVD in STIS aperture closest to the center (r = 
— 0"012) obtained with the two different stellar templates. Solid 
curve corresponds to deconvolution with Template 1 (a G8 III star) 
and dot-dash curve corresponds to deconvolution with Template 2 
(a G5 III star). The velocity is with respect to the heliocentric 
velocity of the galaxy. The vertical axis is in arbitrary units. 



TABLE 2 

Composite /—band Surface Brightness Profile for NGC 

205 



(1) 



I 

(2) 



(3) (1) (2) (3) 1 



0.025 
0.05 
0.075 
0.1 
0.125 
0.15 
0.175 
0.2 
0.225 
0.25 
0.275 
0.3 
0.325 
0.35 
0.3855 
0.5328 
0.7418 
1.037 
1.454 
2.043 
2.876 
4.051 
5.605 
7.543 
9.157 
13.92 



12.35 
12.48 
12.65 
12.85 
13.07 
13.24 
13.44 
13.62 
13.76 
13.91 
14.09 
14.31 
14.51 
14.67 
14.85 
15.5 
16.24 
16.82 
17.47 
17.76 
17.99 
18.24 
18.36 
18.43 
18.51 
18.56 



0.2286 
0.234 
0.1953 
0.0988 
0.0082 
0.0566 
0.0891 
0.1132 
0.1453 
0.1449 
0.1177 
0.0909 
0.0718 
0.0234 
0.091 
0.091 
0.091 
0.091 
0.091 
0.091 
0.091 
0.091 
0.091 
0.091 
0.091 
0.3 



17.53 
22.91 
24.29 
27.09 
30.01 
33.01 
37.12 
41.45 
44.91 
50.16 
55.6 
61.16 
68.35 
76.43 
85.51 
95.72 
106.3 
118.0 
129.8 
144.1 
158.5 
179.5 
197.4 
217.2 
238.9 
262.8 



18.62 0.52 

18.7 0.29 

18.8 0.27 
18.82 0.29 
18.86 0.3 
18.94 0.31 
18.99 0.33 
19.06 0.34 
19.11 0.35 
19.17 0.34 
19.25 0.36 
19.34 0.36 
19.42 0.38 
19.52 0.4 
19.62 0.42 
19.72 0.43 
19.81 0.45 

19.9 0.46 
19.99 0.46 

20.1 0.47 
20.21 0.47 
20.33 0.5 
20.46 
20.61 
20.78 
20.95 



0.5 
0.5 
0.5 
0.5 



1 Columns are: (1) Semi-major axis length (in arcsec); (2) I— band 
magnitude; (3) Ellipticity. Data for r < 0".35 are derived by ap- 
plying IRAF/ELLIPSE to the ACS/HRC data. Data for 0".3855 
< r < 9" 157 are measured using the radial annuli technique de- 
scribed in §|5] again from the HRC images. Data at larger radii 
are based on ground-based data from Kim & Lee (1998) 



Dynamical constraints on the possible values of the 
parameters, the black hole mass Mbh and the stellar 
M/L ratio T, were computed via orbital superposition 
(Schwarzschild 1979) by constructing oblate spheroidal 



P is- 



□ [ (ACS, ELLIPSE) 

■ [ (ACS, SURFPHOT) 

O I (ground, K&L) 

ft I (ground, Lee) 

A I (WFPC2, ELLIPSE) 

* I (WFPC2, SURFPHOT) 
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Fig. 4. — The I— band surface brightness profile of NGC 
205, obtained by combining the HST/ACS observation discussed 
in this paper to the ground-based data of Kim & Lee (1998) 
and Lee (1996). The radius is the "effective radius", given by 

r = rsMA\/l — e(r), where rsMA is the galaxy semi- major axis, 
and e the ellipticity. 



models for a large number of [Mbh , T] . The observ- 
able properties of the models were then compared to the 
surface brightness and spectral data. With the excep- 
tions detailed in §3.1 and §3.2, the modeling algorithm 
used is the one described in Valluri et al. (2004, hereafter 
VME04) , to which paper we refer the reader for a detailed 
description of the method. The numerically-computed 
orbits were "observed" at each of the 31 ground-based 
apertures and the 12 STIS apertures. For the STIS aper- 
tures, a Gaussian PSF with FWHM of 0"1 was assumed 
(Bower et al. 2001), and the model results degraded ac- 
cordingly. Detailed seeing information is not available 
for the ground-based data, therefore the corresponding 
model results were not PSF-convolved. We have found 
that the omission is unimportant as long as the data 
were obtained with apertures wider than the seeing disk, 
which is indeed the case for the ground-based observa- 
tions used in the analysis. 

A non-negative quadratic programming routine 
(E04NCF of the NAG libraries) was used to find the 
weighted superposition of the orbits that best repro- 
duces both the assumed model stellar density distribu- 
tion p(zu, z) and the observed kinematical data. The to- 
tal number of luminosity constraints in NGC 205 is 304 
(of which the inner 128 are from the ACS photometry); 
kinematical constraints consist of 62 contraints from the 
ground-based data (velocities V and velocity dispersions 
ct within 84"56) and 40 contraints from the STIS data 
(V, a and the GH moments h^^h^) (Note: the last two 
apertures on the right end of the slit in Fig 2 were acci- 
dentally excluded from the dataset). 

Since it is necessary to explore a wide range of the 
parameters values: Mbh, M/L-ratio (typically between 
150-200 pairs of parameters), we run an intitial set of 
models using a total of 4500 orbits per model. Since 
the solution space is sensitive to the size of the orbit li- 
brary (see VME04), we follow these initial runs with a 
limited number of models (keeping either Mbh or M/L- 
ratio fixed) with larger libraries of 8100 orbits each. The 
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CPU time required for a single model with 8100 orbits 
is 7.5 hours on a 2.4GHz Intel Xeon processor makes it 
prohibitively expensive to run a large number of models 
at this resolution). In the next two subsections we dis- 
cuss two aspects in which our models differ from those 
described in VME04. 

3.1. Radially Varying Mass to Light Ratios 

In the standard treatment of the 3-integral modeling 
problem the luminosity density - derived as described in 
§2.2 - is converted into a mass density via a mass-to- 
light ratio T, p*(m,z) = Yj»(ci7, z). In most studies, T 
is assumed to be constant over the entire galaxy. Under 
this standard assumption, our initial models were un- 
able to produce an even remotely reasonable fit to the 
kinematics in NGC 205. This is not surprising since T is 
known to show significant radial variations in this galaxy. 
For instance, Carter & Sadler (1990), using simple King 
models, find Y# = 9.4 for the main body of the galaxy 
(with an uncertainty of up to a factor of 2, especially at 
large radii) and Y# = 2.35 for the nucleus (equivalent to 
I-band values of T/ = 6.95 and T/ = 1.94 respectively 
using standard color corrections). Allowing for a radial 
change in T mandates a cautionary note: the problem 
of determining both Mbh and Y(r) is likely to be de- 
generate, and in principle a variety of possible T profiles 
could undoubtedly be found that fit the observational 
constraints equally well, each producing a different best- 
fit (or upper limit) value for Mbh- 

We produced an independent estimate of Tj in NGC 
205 by using the 3-integral models to fit the luminos- 
ity distributions and the kinematical data separately for 
the nucleus and the main body of the galaxy. The lu- 
minosity density of the nucleus is well approximated by 
a Gaussian with FWHM ~ 0"26 (or a = 0".ll) and the 
nucleus can therefore be regarded as the region within 
r < 0"33 (corresponding to ~ 3cr of the Gaussian fit to 
the luminosity density of the nucleus). The main body 
can be regarded as the region beyond r > 0"55 (~ 5a 
of the Gaussian fit to the luminosity density of the nu- 
cleus) where the color and luminosity profile change over 
to that of the main body of the galaxy. Figure [S] shows 
ID curves of Ax 2 = X 2 — Xmin f° r the nucleus alone 
(r < 0"33, solid line) and for the outer region alone 
(r > 0"55, red dot-dash line) as a function of Yj for 
the case in which no black hole is present. All mod- 
els used a library with 8100 orbits. The horizontal thin 
dot-dash line is at A% 2 = 1. The solid black line sug- 
gests that (Ti) nuc i eus — 1.6 ±0.8 (la uncertainties) and 
C^i)body = 9±2'|, consistent within la with the values 
derived by Carter & Sadler (1990). We point out that 
neither estimate is flawless. The adoption of King pro- 
files is not motivated by current data (or our current 
knowledge of the power law form of luminosity profiles 
of elliptical galaxies). Currently both methods assume 
that it is possible to estimate M/L ratios separately for 
the two regions by treating them as dynamically sepa- 
rate. However in practice they are a single dynamical 
entity and the M/L profile needs to be obtained for both 
regions simultaneously. The most general method for 
finding the radial profile Tj(r) would be to allow M/L 
to vary with radius and obtain a non-parametric best-fit 
estimate for this function for this using a method similar 
to the Schwarzschild method. This beyond the scope of 
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Fig. 5. — ID curves of A\ 2 for the nuclear region (r < 0"33, 
solid line) and for the main body of the galaxy (r > 0"6, red dot- 
dash line) as a function of the I-Band M/L ratio Tj. All models 
assume Mbh = and used 8100 orbits each. The vertical arrows 
show the Tj values obtained by Sadler and Carter (1990) using 
King-models. 



the current paper. 

For lack of stronger constraints, we make the sim- 
plest possible assumption about the radial variation of 
Y, namely we assume Y/ to be constant within 0"33 
(the nuclear region) and beyond 0''55 (the main body 
of the galaxy), and to vary linearly between these two 
values at intermediate radii. Although the values of 
Y derived by Carter & Sadler and those derived using 
the Schwarzchild method are consistent, in the following 
section we will carry out the dynamical modeling using 
both. In the first case (the "Carter-Sadler profile") we 
adopt Yj = 1.94 for r < 0"33, Y/ = 6.95 for r > 0"55 
and linearly-varying between these two radii. In the sec- 
ond case (the "Schwarzschild profile") we adopt Tj = 1.6 
for r < 0"33, Yj = 8.5 for r > 0"55 and linearly- 
varying in between. In what follows we will indicate these 
"primary" profiles as Y}(r), and introduce, in the dy- 
namical modeling, a scaling factor, St = T/(r)/Y}(r), 
as a free parameter. The second free parameter is the 
mass Mbh of the central black hole. For libraries which 
adopted the Carter-Sadler Y-profile we constructed dy- 
namical models for 17 different values of Mbh i n the 
range < M B h < 10 7 M Q and ~ 13 values of S T - For 
libraries which adopted the Schwarzschild Y-profile we 
constructed dynamical models for 15 different values of 
Mbh in the range < M B h < 10 6 M and 13 values 
of S T . 



Valluri et a 



7 



o 




1 2 3 4 5 6 7 




log 1o (M BH /M ) 
Fig. 6.— ID reduced \ 2 (Xr) 

curves as a function of black hole 
mass (Mbh) for 6 different values of the regularization (smoothing) 
parameter A. All models assume Tj = 1.1. See text for further 
details. 



3.2. Regularization 

It is customary to smooth, or "regularize" , the orbital 
solutions, either by including a penalty function with ad- 
justable smoothing parameter (e.g. VME04), by impos- 
ing a "maximum entropy" constraint (e.g. Gebhardt et 
al. 2003) or by imposing a local smoothness constraint in 
phase space (Cretton et al. 1999). As discussed at length 
in VME04, the results can depend strongly on the value 
of the smoothing parameter A. An incorrectly chosen A 
can generate misleading results; for instance, too great 
a degree of smoothing has the same effect as limiting 
the range of allowed orbital populations, and can give a 
spurious, best-fit value of Mbh even when no value is 
preferred by the data. Since the optimal choice of A de- 
pends on the data set in question, we constructed models 
for nine different values of A between 10 -7 and 10. In 
the interest of time, all models used the Carter-Sadler 
T-profiles and libraries of 4500 orbits. 

ID reduced x 2 curves (y |) fo r a fixed Sr = 1-1 (the 
T-scaling factor defined in § 13. 1[) are plotted in Figure 
for six values of A. (xr is computed for DOF=100 as 
discussed later in § refsecxarter-sadler). Most of the 
curves are characterized by a nearly flat valley in \ 2 at 
Mbh < 10 5 M Q . Figure also shows that for smallest 
value of A = 10~ 6 (thin solid curve) the Xr Pl°t * s n °isy 
and oscillates randomly from one model to the next. For 
A = 10~ 4 — 10~ 2 the curves are qualitatively similar: flat 
from Mbh = — 10 5 M© followed by a sharp upturn at 
higher Mbh- For A = 10~ 4 the x R = 0.23 in the flat 
region, for 10 -3 the Xr = 0-62 in the flat region (thick 
solid line) and for A = 10~ 2 the Xr = 1-07. For values 
of A larger than A = 10~ 2 Xr increases above one, and 
the upper limit on Mbh depends more sensitively on A, 
with Mbh decreasing as A increases. We could choose 
either A = 10~ 3 or A = 10~ 2 . We chose A = 10~ 3 as the 
value that gives the largest x% < 1- I n the remainder of 
this paper (unless otherwise noted) this is our preferred 
value of A = 10~ 3 . Other, ostensibly, "objective" meth- 
ods to determine the optimal value of the regularization 
parameter exist but they are either not well motivated 
physically or extremely costly computationally. In the 
interest of limiting the number of parameters varied we 
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Fig. 7. — Contours of constant \ 2 derived from fits of the data 
to 3-Integral axisymmetric models. Dashed contours: fits to the 
Bender et al. (1991) ground-based data. Solid contours: fit to the 
combined ground-based + STIS GH moments derived using Tem- 
plate 1. Each dot represents a set of parameters {Mbh > St) f° r 
which a 3-Integral dynamical model was constructed. The total 
number of orbits used in each case was N a = 4500. The first 3 
contours of each type are at la, 2a, 3a confidence intervals respec- 
tively (68.5%, 90% and 95%). The regularization parameter was 
A = 10" 3 . 



use of A = 10~ 3 also for models using libraries of 8100 
orbits. Our quoted results of the upper limit on Mbh 
and choice of Sr are all for this value of A, but this does 
not imply that other values of A give statistically less 
probable results. 

4. RESULTS OF 3-INTEGRAL MODELING 

4.1. The Zeroth- Order Model 

We begin our analysis by discussing the results of the 
dynamical models under the following conditions: 1) all 
models use a 4500 orbit library; 2) the regularization pa- 
rameter A is set equal to 10 -3 ; 3) the TJ profile is that 
given by Carter and Sadler (1990) (§3.1); and 4) the 
STIS kincmatical information is extracted using Tem- 
plate 1 (§2.1). In the next subsection we will change 
some of these conditions and investigate how the results 
arc affected. 

Figure[3shows 2-dimensional contour plots of the total 
X 2 as a function of S r = T 7 (r)/TJ(r) and Mbh- The 
models covered 17 values of Mbh and 13 values of Sr (in- 
dicated by the grid of points); the contours are plotted 
at A(x 2 ) = 2.30,4.61,6.17,9.21,11.8,18.4 which corre- 
spond to 68.3%, 90%, 95.4%, 99%, 99.73%, 99.99% con- 
fidence intervals assuming two degrees of freedom (DOF) 
(but see note below). Beyond the 6th contour the spacing 
of contours is arbitrary. Two sets of contours are shown, 
depending on which set of kincmatical constraints are fit- 
ted. While every model is required to fit the luminosity 
(mass) constraints (for a total of 304 constraints), the 
dashed (red) contours correspond to models which were 
required to fit the ground-based kincmatical data only (a 
total of 62 constraints). The solid (black) contours show 
the results when the fit is also required to be constrained 
by the STIS data (V, a, and hi) for an additional 40 
constraints. 

Both sets of x 2 contours are striking in their lack of 
dependence on Sr , which is determined from these plots 
to be 1.13±q " 8 (lc uncertainty). Although a minimum 
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Fig. 8. — ID x 2 curves which illustrate the dependence of the 
upper limit on Mbh on orbit library size and on smoothing pa- 
rameter A. Individual curves are described in the text. 



in the \ 2 contours is not seen, the addition of the small- 
radius spectral data from STIS allows us to reduce the 
upper limit on Mbh by an order of magnitude, as shown 
more clearly in Figure |H1 The curves labeled A, B are 
ID cuts through Figure at the minimum in Sy = 1.1: 
curve A is a fit to the ground and STIS data and gives an 
upper limit of Mbh ~ 8 x 10 3 M Q while curve B is the fit 
to the ground-based data alone and gives an upper limit 
of Mbh ~ 1O 5 M0 (both upper limits are given at the 
la confidence level, shown by the the horizontal dotted 
lines drawn above each \ 2 curve). Note that the ID \ 2 
plots shown in Figure|S] are not marginalized with respect 
to Sy, but are merely cuts through the 2-parameter \ 2 
plots. (Curves C and D of Fi^S] arc described in the 
following section.) 

A well known assumption of the use of the \ 2 statis- 
tic is that the problem is linear in the parameters being 
estimated. An additional unstated assumption in all dis- 
cussions of confidence interval estimation (e.g. Press et 
al. 1992) is that the number of fitted parameters is less 
than the number of data points - in other words, that the 
parameter estimation problem is over-constrained by the 
data. It is only in this case that quantities like "num- 
ber of degrees of freedom" (DOF=A r data - ^parameters) 
make any sense. But in Schwarzschild modeling, the 
number of parameters (orbits + potential parameters) 
is typically far greater than the number of data points, 
i.e., the problem is under-constrained by the data. Indeed, 
one typically finds that a variety of choices for the pa- 
rameters (MbhiY) can reproduce the data equally well 
(VME04), and if it were not for positivity constraints on 
the orbital occupation numbers, this degeneracy would 
be even greater. We know of no discussion in the statis- 
tical literature that deals adequately with this situation, 
and for this reason, the standard prescriptions for esti- 
mating confidence regions (e.g. Press et al.) should be 
considered suspect when applied to Schwarzschild mod- 
elling. 

It is therefore necessary to for us to first clarify how the 
confidence intervals are computed. The quality of a par- 
ticular fit was assessed by comparing the total \ 2 value 
of the fit to the number of degrees of freedom (DOF), as 
being "the number of independent data points - number 
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of potential parameters" . Thus the number of parame- 
ters is assumed to be 2 (while actually it is much larger 
due to the orbits). For the NGC 205 dataset, the num- 
ber of independent data points is 102 (for the STIS plus 
ground-based kinematical data) or 62 (for the ground- 
based kinematical data only; note that the luminosity 
constraints are "model constraints" and not "data con- 
straints"), while the number of parameters is two (Sy 
and Mbh), giving DOF=100 (STIS+ground based data) 
or DOF=60 (ground-based data only). This DOF was 
used in § 13.21 to compute reduced-x 2 (Xr) which is ex- 
pected to be ~ 1 for a good fit. It has become customary 
in this field to further assume that the confidence inter- 
vals on the parameters being estimated (namely Mbh 
and T) can be obtained from A(% 2 ) distribution with 
the number of degrees of freedom = to the number of 
parameters. 

Note also that unless otherwise stated the upper limits 
quoted are la confidence limits. In a few instances we 
also quote 3a limits. Also we caution that the contour 
plots are not reliable for determining upperlimits since 
they are plotted with an algorithm which uses adaptive 
2D regression - we always compute upperlimits from ID 
cuts through the \ 2 plots. It is also worth noting that 
when judging whether, for instance, a model with a black 
hole is preferred over a black-hole-free model, we urge 
the skeptical reader to follow our practice of closely in- 
specting the kinematical profiles predicted by the differ- 
ent models and deciding whether either model is signifi- 
cantly better at reproducing the data in the region where 
the black hole's influence would be felt. 

4.2. Robustness of the Results 

In this section we explore the dependence of the results 
obtained above on 1) the size of the orbit library; 2) the 
choice of the smoothing parameter A; 3) the stellar tem- 
plate used in recovering the kinematical data from the 
STIS spectra; and 4) the choice of T profile. The results 
of all models discussed in this section are summarized in 
Table 3. 

We consider first the size of the orbit library. Curves 
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Fig. 9. — Comparison of x 2 contours for models that fit the STIS 
GH moments obtained using the two different stellar templates. 
Solid (black) contours are for models that fit kinematics derived 
using Template f and dot-dash (red) contours are for models that 
fit kinematics from Template 2. 
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Fig. f 0. — lD-x 2 curves obtained by fitting the central LOSVD 
and the Gauss-Hermite moments for other apertures. The solid 
line is from fitting the central LOSVD and GH moments for other 
apertures obtained with Template 1 and the dot-dash line is from 
fitting the kinematics obtained with Template 2. 



C, D in Figure are analogous to curves A, B ex- 
cept they were derived from models which used 8100 
orbits. In the interest of time, these models were run 
at Tj(r)/T|(r) = 1.1, for 17 different values of M BH - 
It is evident from the figure that increasing the size of 
the orbit library relaxes the upper limit on Mbh, at 
least when both the STIS and ground based data are 
fitted. Using the ground based-data only, we now de- 
rive an upper limit of Mbh — 6.4 x 10 4 M Q (curve D), 
while adding the STIS constraints produces an upper 
limit M B h < 2.2 x 10 4 M Q (curve C) (at la confidence) 
and an upper limit of Mbh ^ 3.9 x 10 4 M Q (at 3a con- 
fidence). This sensitivity to the size of the orbit library 
was discussed in VME04, where we advocated using the 
largest orbit library that was computationally feasible 
to obtain the full range of statistically allowed solutions. 
(We do not believe that the decrease in upper limit of 
Mbh derived from the ground-based data is statistically 
significant. An examination of the contribution to x 2 
shows that this number is sensitive to the the noise in 
the large radius data which has no bearing on the mass 
of a central black hole.) 

As mentioned earlier, the choice of A depends on the 
size of the orbit library. For the 4500 orbit models, A = 
10~ 3 was identified in §3.2 as the best value based on 
the fact that this gave a Xr closest to, but below one. 
The effect of changing A for this larger orbit library is 
shown Table 3 (Run No. 4-7). When the full dataset 
(STIS+ground based) is fitted with a smaller smoothing 
parameter A = 10~ 4 , the la upperlimit increases by 60% 
to M BH < 3.5 x 10 4 M Q (with x\ ~ 0.23. When the 
same dataset is fitted with a larger smoothing parameter 
A = 10~ 2 , the upperlimit deccreases by 50% to Mbh ^ 
1.1 x 10 4 M o (with x R ~ 0.9. And when the dataset 
is fitted with A = 10 _1 , the upperlimit decreases even 
further to M BH < 0.77 x 10 4 M Q (with reduced x 2 ~ 1.4. 

As A increases, the fit to the data becomes smoother 
and as a consequence x 2 also increases. We know of no 
truly robust way to pick smoothing such that it does not 
strongly influence the derived estimate of Mbh- In the 
absence of a robust choice of A we simply quote the value 
obtained for A = 10~ 3 for reasons given in § 13.21 



The effect of using a different template in extracting 
the kinematical constraints from the STIS data (§2.1) is 
shown in Figure [5] This figure is directly comparable to 
Figured since it was produced using a 4500 orbit library 
and A = 10~ 3 . As expected based on the consistency 
between the V, a, h%,h± values shown in Figure|21 the fits 
to the kinematics using the two stellar templates give 
virtually identical limits on both Sx and Mbh- 

However, as discussed in §2.1 and shown in Figure |3 
the two different templates do produce significantly dif- 
ferent shapes for the LOSVD in the innermost apertures. 
While these differences are disconcerting, it is neverthe- 
less desirable to try to include the extra information con- 
tained within the central LOSVD, since it is here that 
the effect of a central black hole would make its presence 
most strongly felt (e.g. van der Marel 1994). Earlier au- 
thors (e.g. Gebhardt et al. 2000b) have modeled even 
strongly asymmetric central LOSVDs in the detection of 
a nuclear black hole with axisymmetric models. These 
authors attribute the asymmetry in the central LOSVD 
to an off-centered central aperture as well as due to the 
presence of obscuring dust. 

Figure shows two lD-x 2 plots obtained by fitting 
the full LOSVD (sampled at AV" = 20km s" 1 ) for the 
central STIS aperture, and the Gauss-Hermite moments 
for other apertures. The solid and dot-dashed lines refer 
to fits to the kinematics derived using Template 1 and 
Template 2 respectively; as before, the two horizontal 
dot-dash lines represent the la confidence limit above the 
minimum of the corresponding curve. All models used or- 
bit libraries with 8100 orbits and a smoothing parameter 
A = 10~ 3 . Although the models that fit kinematics from 
Template 1 (solid line black) give a systematically higher 
X 2 because of the difficulty in fitting the highly asym- 
metric central LOSVD, both LOSVDs produce identical 
upper limits on Mbh of 2.2 x 10 4 Mq - which is also 
identical to that obtained from fitting the GH moments 
for all the apertures (curve C in Fig[7|). The insensitiv- 
ity of the results to whether the full LOSVD is fitted or 
not, is likely to be a result of the fact that the sphere of 
influence of a putative central black hole in NGC 205 is 
not resolved, and therefore the number of high velocity 
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stars which are able to affect the high velocity wings of 
the LOSVD is proportionately small. While the LOSVD 
from Template 1 might point to a detection of heavy 
wings, the increased difficulty of fitting the asymmetry 
with an axisymmetric model does not appear to result in 
an increase in the upper limit. 

We conclude this section by discussing how the esti- 
mate of Mbh is affected by choice of T profile. We would 
have preferred to do this for the models with 8100 orbits; 
however because of the modeling time involved, we first 
ran a full set of models with the smaller (4500) orbit li- 
brary, varying both St and Mbh, and then ran a subset 
of the larger, more time-intensive models at the value of 
St corresponding to the minimum found from the previ- 
ous set of models. Models using 4500 orbits were there- 
fore constructed using the Schwarzschild T profile for 14 
different values of Mbh from 0.1 to 10 5 M Q and 13 val- 
ues of St- The models were then fitted to the luminosity 
(mass) constraints, plus the GH moments obtained us- 
ing Template 1 . A two-dimensional x 2 plot for this set of 
models is shown is Figure ITlT b'k the two-dimensional \ 2 
plot with models that used the Carter-Sadler profile is 
reproduced again in Figure UlT a) for comparison. As be- 
fore, the three inner most contours represent the lcr, 2a 
and 3cr (A X 2 = 2.3,4.6,6.1 for DOF=2) confidence in- 
tervals respectively. In Figure 1121 we plot cuts through 
Figures UTT a^ (solid curve) and Figures UTT b) (dot-dash 
curve) for a fixed Mbh = 10 2 . The Schwarzschild 
T profile models show a minimum at St = 0.95±g'o§ 
(Fig. ITTTb)) (as opposed to St = 1.13±g;? 8 in the case 
of the Carter-Sadler profile (Fig. II If all. The intercepts 
with the horizontal dot-dash line indicate the la confi- 
dence intervals. (Note that a few additional models were 
run in each case to fully explore the minium). Figure El 
shows that the best fit values of St for the two mod- 
els are significantly different at the la level. Values of 
St close to the minima in Fig El were then chosen to 
construct a subset of models with 8100 orbits for all val- 
ues of Mbh- The ID \ 2 profiles resulting from these 
larger orbit libraries are shown in Figure 1131 as a dot- 
dashed curve (the results using the Carter-Sadler pro- 
file are shown as a solid curve for comparison). The 
Schwarzchild profile gives Mbh < 1.3 x 10 4 M Q , com- 
pared to Mbh < 2.2 x 10 4 M Q for the Carter-Sadler 
profile. 

Finally, we note that the overall \ 2 produced by the 
best fit models for the two T profiles are virtually in- 
distinguishable from each other (the minima differ by 
A\ 2 ~ 1), and therefore there is no reason to prefer one 
profile over the other. This is an indication that the 
3-integral modeling problem is inherently degenerate as 
discussed at length in VME04. 

From the discussion above we conclude that 2.2 x 
1O 4 M0 represents a firm upper limit to the mass of 
the central black hole in NGC 205 (Table 3). Fig- 
ure El shows the model fits to the kinematical data for 
St = 1.1 (for models obtained with the Carter-Sadler 
profile and 8100 orbits per library) and four different 
black hole masses: no black hole (dot-dash line), Mbh = 
1.1 x 1O 3 M (dashed line); M BH = 2.2 x 10 4 M Q (solid 
line, corresponding to the la confidence upper limits for 
models with 8100 orbits and A = 10~ 3 ), Mbh = 5 X 10 4 
M Q (dotted line, corresponding to the 5a limit for the 
same model). The plot confirms that the two smaller 



values of Mbh produce statistically equivalent fits to 
the data, and that the central STIS velocity dispersion, 
a ~ 21km s _1 , provides the strongest constraint on the 
upper limit on the mass of a massive central black hole. 

5. IMPLICATIONS FOR BLACK HOLE SCALING 
RELATIONS 

While the firm dynamical detection of an IBH has yet 
to be achieved in any galaxy, there are now two se- 
cure upper limits, in M33 (Merritt et al. 2001; Geb- 
hardt et al. 2001) and in NGC 205 (this paper). Here 
we investigate the implications of these upper limits for 
the form of the scaling relations that have been estab- 
lished using SBH detections in more massive galaxies. 
The tightest of these relations is the Mbh — & rela- 
tion as derived from central aperture dispersions (Fer- 
rarese & Merritt 2000). This relation is plotted in Fig- 
ure El as updated by Ferrarese & Ford (2004), using 
only those galaxies (25 in number) with secure dynami- 
cal detections. The solid line in Figure El shows the best 
fit regression line: M BH ,s = (1-66 ± 0.24)cr 2 oo 4 - 86±a43 
(M BH ,s = M BH /10 S M G , a 2Q0 = a/200 km s" 1 ). Also 
plotted are the upper limits on Mbh in M33 and NGC 
205. For M33, we conservatively adopt M B h < 3000 
Mq (Merritt et al. 2001), rather than the smaller value 
of 1000 M Q claimed by Gebhardt et al. (2001). 

Also shown (dotted line in Figure 115(1 is the relation 
claimed by Tremaine et al. (2002), using a velocity dis- 
persion measured by those authors within one effective 
radius. The difference in slopes between the two rela- 
tions is barely significant in a statistical sense, but be- 
comes critical when the relations are extrapolated to the 
low mass regime. The upper limits on the IBH masses in 
M33 and NGC 205 are both inconsistent with the shal- 
lower Mbh — o relation advocated by Tremaine et al. 
(2002) but not with the Ferrarese & Ford (2004) or Fer- 
rarese & Merritt (2000) relations. 

Given the importance of determining whether these 
upper limits on Mbh are consistent with the scaling re- 
lations determined at higher masses, we must address the 
issue of the slope. Among the differences in the way that 
the two groups construct the Mbh — & relation, proba- 
bly the most important is sample definition. Gebhardt 
et al. (2000a) and Tremaine et al.(2002) adopt a less 
restrictive criterion than Ferrarese & Merritt (2000) in 
establishing the reliability of a SBH detection; in partic- 
ular, they include several detection based on data which 
do not resolve the SBH sphere of influence (comprising 
roughly 30% of the claimed SBH detections based on stel- 
lar kinematics), and others (e.g. NGC 3379; Gebhardt et 
al. 2000b) for which the authors themselves acknowledge 
that a model with no SBH fits the data precisely as well 
as a model containing a SBH. Tremaine et al. (2002) also 
include SBH mass estimates which are deemed by their 
own authors to have systematic uncertainties that exceed 
the quoted errors (NGC1068, Greenhill et al. 1996; NGC 
4459, NGC 4596, Sarzi et al. 2001; NGC224, Bacon et al. 
2001). Empirically, the scatter in all SBH scaling rela- 
tions is seen to decrease, sometimes dramatically, when 
detections based on galaxies which do not resolve the 
sphere of influence are removed from the sample (Mer- 
ritt & Ferrarese 2001; Ferrarese & Ford 2004; Graham et 
al. 2001; Marconi h Hunt 2003). The slope is also found 
to increase when such detections are removed from the 
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Fig. 11. — Comparion of \ 2 contour plots obtained by fitting data with libraries (of 4500 orbits) constructed with the two different T 
profiles: (a) for orbit libraries that used the Carter-Sadler profile; (b) for libraries constructed with the Schwarzschild profile. 




Fig. 12. — Cuts through 2Dx 2 contour plots in Fig^at M B H = 
10 2 Mq for the Carter-Sadler profile (solid black curve); for the 
Schwarzschild profile (dot-dash red curve). 
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Fig. 13. — Comparion of ID \ 2 curves at fixed values of St 
obtained by fitting data with libraries of 8100 orbits for libraries 
that used the Carter-Sadler profile (solid curve); for libraries con- 
structed with the Schwarzschild profile (dot-dash curve). 



Mbh — o~ relation, such as that derived by Ferrarese & 
Merritt (2000) and Ferrarese & Ford (2004), is more ap- 
propriate. Therefore, the upper limits on the masses of 
nuclear IBH in M33 and NGC 205 are not inconsistent 
with the extrapolation of the Mbh — & relation to the 
low-mass regime. Put another way, while there is still no 
compelling dynamical evidence for IBHs in these galax- 
ies, we can not yet rule out the presence of IBHs with 
masses similar to those predicted by the established scal- 
ing relations. 

Rather than examine whether the upper limits on 
Mbh in M33 and NGC 205 are consistent with the 
Mbh — o relation, we can rederive that relation under 
the assumption that it applies to both galaxies, using 
the upper limits on Mbh as constraints. We apply a 
technique called "regression with censored data" that is 
now standard amongst statisticians and has been applied 
in a few astronomical contexts (e.g. Isobe, Feigelson & 
Nelson 1986). Two censored regression methods are in 
wide use: the EM algorithm (Dempster, Laird & Ru- 
bin 1977), a maximum likelihood method that assumes 
a normal distribution of the ordinate values about the 
best-fit line; and a modification of the EM algorithm 
by Buckley & James (1979), hereafter the BJ algorithm, 
which infers the Y"-distribution from the measurements 
themselves and is believed to be more robust to non- 
normal errors. Unfortunately, neither algorithm takes 
into account measurement errors in either variable, 5 and 
for neither is there a well-accepted scheme for estimating 
the uncertainties in the fitted coefficients. In addition, 
these algorithms assume that the upper limits are pre- 
cisely known. 

Table 4 gives coefficients in the fit of the data in Figure 
14 to 

logic, M BH,8 = OL log 1Q (7200 + 0- (1) 

We used the implementations of the EM and BJ algo- 
rithms provided by Isobe et al. (1986). The first line in 
Table 4 gives the slope (4.52 ± 0.36) and intercept when 
M33 and NGC 205 are excluded. The slope is slightly 



sample (Ferrarese & Ford 2004). 
Our conclusion is therefore that a steeper slope to the 



5 This was also the case for the regression algorithm used by 
Gebhardt et al. (2000a) in their derivation of the slope of the 
Mbh ~ a relation. 
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orbits for 4 different value of Mbh as indicated by the labels. The solid line is the fit for the upper limit Mbh = 2.2 X 1O 4 M0. 



lower than the value (4.86) quoted above; the latter was 
derived from a regression algorithm that accounts for 
measurement errors (in general, ignoring measurement 
errors leads to spuriously low estimates of the slope). 
The remaining lines give the fitting parameters from the 
EM and BJ algorithms, respectively, under two assump- 
tions about the upper limits on Mbh in M33 and NGC 
205. Including the two upper limits always increases the 
inferred slope, to values in the range 5.2 < a < 5.7. Had 
we been able to include measurement errors in the cen- 
sored regression algorithms, these slopes would probably 
have been even greater. We consider these values to be 
the best current estimates of the slope of the Mbh — & 
relation, under the assumption that the relation extends 
to spheroids as faint as M33 and NGC 205. 



TABLE 4 



Fits to log Mbh,% = <* log °2oo + P 


M BH (M33) Mbh(NGC205) 


a 


P 


< 3000 

< 1000 

< 3000 

< 1000 


< 10,000 

< 10, 000 

< 35, 000 

< 35, 000 


4.52 ± 0.36 
5.48(5.43) 
5.67(5.58) 
5.29(5.23) 
5.52(5.43) 


0.22 ±0.06 
0.17(0.17) 
0.16(0.17) 
0.18(0.18) 
0.17(0.17) 



6. MORPHOLOGICAL CONSTRAINTS ON THE PRESENCE 
OF A MASSIVE BH IN NGC 205 

The presence or absence of a massive black hole in 
NGC 205 might be inferred indirectly, from the observed 
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Fig. 15. — Mbh — o" relation. Solid line is the best fit from 
Ferrarese & Ford (2004), with lcr errors on the slope shown by the 
dashed line. The dotted line is the best fit from Tremaine et al. 
(2002). Only masses based on data which resolve the sphere of 
influence are plotted; the stellar velocity dispersion <r is as defined 
in Ferrarese & Merritt (2000). 



structure of the nucleus combined with evolutionary ar- 
guments. We begin by contrasting the structure of NGC 
205 with that of the other well-resolved Local Group 
spheroids, in M31, M32, M33, and the Milky Way (Lauer 
et al. 1992, 1998; Genzel et al. 2003). Each of these 
galaxies exhibits a steep luminosity profile, p ~ r 1 , 
1-5 ^ 7 ;$ 2, inward of ~ 1 pc, with densities at 0.1 
pc that range from - 1O 6 M pc" 3 (M33) to - 1O 7 M 
pc~ 3 (M32). Only M33 and NGC 205 exhibit a core, 
with radius ~ 0.2 pc in both galaxies. The inferred cen- 
tral density of M33, assuming M/L v = 0.4 in solar units, 
is ~ 2 x 10 6 M Q pc~ 3 (Lauer et al. 1998), compared with 
- 3 x 1O 5 M pc" 3 in our mass model of NGC 205. M33 
and NGC 205 also have similar central kinematics: the 
ID central rms velocity is ~ 20—30 km s _1 in both galax- 
ies, compared with much higher values at the centers of 
the other Local Group galaxies. 

The lack of a dynamical detection of a massive BH in 
NGC 205 also implies an upper limit to its gravitational 
influence radius, r% < 0.16pc Mbh, 4&20 (Mbh,4 = 
M BH /lO 4 4Af , CT 2 o = ct/20 km s -1 ). While no clear 
morphological signature is apparent at in any of the 
other Local Group galaxies, even those belived to contain 
massive BHs, the power-law density cusps in M32 and 
the Milky Way are at least consistent with the p ~ r~ 7 / 4 
density profile that is established around a BH on a re- 
laxation time scale T r (Bahcall & Wolf 1976), where 

T - = r2 M<T ! a * 1A X 10 V4,P5 1 (InAxor 1 (2) 
G^m+p in A 

(p 5 = p/lO 5 M pc" 3 , lnAio = In A/10.) The outer ra- 
dius of such a cusp is expected to be ~ 0.2r/j (Preto, Mer- 
ritt & Spurzem 2004), making it unobservable in NGC 
205 even if we adopt our upper limit on Mbh- 

Figure plots T r as a function of radius in NGC 205; 
we also plot the time scale T co u for star-star collsions, 

T coU = [WV^narl (1 + 9)] _1 w 8.5x 10 10 yr^r>5 '©Joo 

(3) 



where is the stellar number density in units of 10 5 
stars (Mq) per pc 3 , and r* are a typical stellar mass 
and radius respectively, and 6 = Gm*/(2cr 2 r+). We eva- 
luted these two time scales as functions of radius us- 
ing kinematical quantities taken from our Schwarzschild 
solutions, setting a 2 — {a 2 + a 2 + o^)/3, m* = M , 
r* = Rq, and In A = 10. The results were found to 
be almost the same whether the assumed BH mass was 
~ or 2.2 x 10 4 M ; Figure EH shows the results for 
Mbh — 0. While the stellar collision time is not directly 
relevant to the issues addressed in this Section, we note 
that T co u is sufficiently long even at 0.1 pc in NGC 205 
that a typical solar-type star is unlikely to have suffered 
a collision in its lifetime. In this respect NGC 205 is 
similar to other Local Group spheroids, all of which have 
Icozz(O.lpc) of order 10 11 yr. Hence we ignore collsions in 
what follows. Collisions might nevertheless have played 
a role in establishing the M/L gradient in NGC 205, a 
subject that will be discussed in more detail in a later 
paper. 

Figure ^| shows that the relaxation time in NGC 205 
is quite short, with a central value of ~ a few xlO 7 yr. 
Among the Local Group spheroids, only M33 has a com- 
parably short central relaxation time, T r S3 5 x 10 6 yr. 
The low value of T r in M33 prompted the suggestion 
(Hernquist, Hut & Kormendy 1991) that the nucleus of 
this galaxy might have undergone core collapse. In the 
remainder of this Section, we discuss whether a similar 
case can be made for NGC 205, and whether the changes 
induced in the nuclear morphology of NGC 205 by core 
collapse would depend on the presence of a massive BH. 

The case for core collapse in M33 is based on its short 
central relaxation time. If core collapse has not already 
occurred in M33, it would be expected to take place in 
a time T cc S3 10 2 T r (0) S3 3 x 10 8 yr, with T r (0) the cur- 
rent central relaxation time. Unless we live at a special 
time, it is therefore very likely that the M33 nucleus has 
already undergone core collapse. The constant-density 
core of M33 might be a result of the binary-driven re- 
expansion that takes place following collapse. 

This argument is not quite as strong in the case of 
NGC 205. We note that the exact constant of propor- 
tionality between T cc and T r (Q) depends on the details 
of the stellar distribution function, and can vary from 
~ 10 to ~ 10 3 (Quinlan 1996). However the nuclear 
density profile of NGC 205 is reasonably close to the 
self-similar, p oc r~ 2 - 2 form of late core collapse, and 
the current central relaxation time is short enough that 
considerable evolution toward core collapse must have al- 
ready occurred. In these circumstances, the constant of 
proportionality should be close to its asymptotic value, 
T cc S3 330T r (0) (Spitzer 1987), and the current central 
value of T r implies that the time remaining to core col- 
lapse (if it has not already occured) is ~ 1.6 x 10 10 yr. 
This time could be shortened somewhat if the central 
density rises inward of the resolution limit, or if the nu- 
cleus contains a range of stellar masses (e.g. Giirkan et 
al. 2004). However T cc is not short enough that we can 
conclude, as in M33, that core collapse has definitely oc- 
curred, and this makes it more difficult to draw definite 
conclusions about the effects of a massive BH on nuclear 
evolution. 

If a massive BH were present at the center of NGC 
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Fig. 16. — Relaxation time T r and stellar collision time T co ;; as 
functions of radius along the major axis in NGC 205, in the model 
solution with Mbh = 0. See text for details. 



205, how would this affect the evolution toward core col- 
lapse or its subsequent effects on nuclear morphology? 
The theory of core collapse in the presence of a BH has 
not been as widely developed as the theory of core col- 
lapse in a purely stellar system, and most discussions 
make particular assumptions about the efficiency of ac- 
cretion of gas from tidally-disrupted stars and its effect 
on the growth of the BH (e.g. Shapiro 1977; Murphy, 
Cohn & Duriscn 1991; Baumgardt et al. 2004). In its 
early stages, core collapse around a BH is driven by the 
same evaporation of high-velocity stars that drives clas- 
sical core collapse (Hcnon 1961). Collapse is eventually 
halted, either by binary formation, or by the input of 
heat due to capture or disruption of stars by the BH. 
The latter process is expeced to dominate the former 
at densities like those of the nucleus of NGC 205 (Mur- 
phy, Cohn & Durisen 1991); the core then re-expands, 
on a time scale of order the stellar consumption time, or 
<~ T r (rVi). Outside of the BH's sphere of influence, the 
density profile in a post-core-collapse nucleus is probably 
similar to that in a nucleus without a BH (e.g. Fig. 8b of 
Murphy ct al. 1991). Significant differences in the rate of 
core collapse or its effects on the structure of the nucleus 
would only be expected if the BH were large enough that 
stellar capture or disruption dominated from the outset 
(Shapiro 1977). 

We conclude that the structure and kinematics of NGC 
205 do not allow us to make a definitive statement about 
whether or not there is morphological evidence for a cen- 
tral SBH. Core collapse may have occurred, but it is also 
possible that the core is still evolving toward collapse. 



dwarf elliptical companion of the Andromeda galaxy. 
The surface brightness profile derived from the /—band 
images was combined with large scale ground based data 
from Lee (1996) and deprojected non-parametrically to 
derive the 3-D luminosity density. The kincmatical in- 
formation (velocity, velocity dispersion and the GH /i 3 
and hi coefficients), extracted from the STIS spectra 
by means of a maximum penalized likcliwood algorithm, 
does not seem to depend significantly on the choice of 
stellar template. The STIS spectra are complemented at 
large radii with data from Bender et al. (1991). The 
full suite of data provides both luminosity and kinematic 
information up to 1.4 arcmin from the center (305 pc at 
the galaxy distance of 740 kpc). 

In an effort to constrain the presence of a central 
black hole, state-of-the-art 3-integral dynamical mod- 
els based on orbital superposition were constructed for 
a large number of the parameters [Mbh, St], where 
Sr = T/(r)/Tj(r) controls the scaling of the radially 
variable stellar M/L ratio relative to a "nominal" value. 

Several sets of models were computed and compared 
with the data, under different assumptions regarding the 
number of orbits, the value of the regularization param- 
eter, and the radial dependence of the mass-to-light ra- 
tio. Model predictions were compared with two sets of 
observed kinematics, derived from the STIS data using 
different template spectra. We also compared the con- 
straints imposed with and without the inclusion of the 
central LOSVD. 

Under no set of assumptions were we able to recover 
a best- fit value of Mbh in NGC 205. The upper limit 
which we derive on Mbh is somewhat dependent on the 
assumptions made in the modelling, as expected based on 
earlier work (VME04). We found upper limits ranging 
from ~ 0.8 x lO 3 M to ~ 3.5 x 10 4 M Q . Even for the 
largest of these upper limits, the radius of influence of 
the black hole would not be resolved by our data, hence 
the failure of the modelling to produce a firm detection 
is not surprising. An upper limit of 2.2 x 10 4 M Q for the 
black hole in NGC 205 - our preferred value (§4.2) - is 
consistent with the extrapolation to the low-mass regime 
of the Mbh — & relation derived from black holes more 
massive than ~ 10 6 M Q . 
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7. CONCLUSIONS 



We discussed new HST ACS/HRC images and STIS 
spectra for the nuclear region of NGC 205, a nucleated 
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